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ABSTRACT 


This thesis examines the nature of the repairable part demand- 
carcass return relationship at the Navy Ships’ Parts Control Center and 
proposes an alternate forecasting method for predicting the number of 
carcasses arriving at the Depot Overhaul Points in the coming quarter. 
The analysis begins with graphical and correlation review of the variables 
and then uses regression, and exponential smoothing to model carcass 
returns. These methods are compared to the model currently in use by an 
analysis of the forecasts errors. The comparison suggests that the 
exponential smoothing model results in the most accurate forecasts. The 


thesis concludes with some recommendations for model implementation. 
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I. INTRODUCTION 


A. BACKGROUND 

The Navy Ships’ Parts Control Center (SPCC) is the cognizant Inventory 
Control Point (ICP) for all non-aviation consumable and repairable spare parts. 
Given the number of applications in the fleet, the demand history, and any 
known technical information/problems associated with a particular part, the 
Inventory Manager (IM) can reasonably predict the number of parts required 
for the short run future, over the next four quarters. This forecast requires the 
IM to specify the manner in which a part will become available to fleet units 
when requested: 1) stock on hand, 2) new procurement contracts, or 3) a 
repaired unit from Navy organic or commercial Depot Overhaul Points (DOPs). 
Efficient management of repairables requires SPCC to forecast repair quantities 
at the DOPs. The limiting factor for the number of repairs, is generally the 
quantity of carcass returns to the depot(s). Once a repairable item is 
determined to be faulty, it is pulled out of the mechanical/electrical system and 
the ship or station requisitions a Ready for Issue (RFI) replacement. When the 
requisition is forwarded into the supply system, a demand is registered at 
SPCC. Although the unusable component is supposed to be shipped off-station 
at the time of requisitioning, the carcass can take two to three quarters to arrive 
at the DOP. This uncertainty in carcass return quantities makes it difficult for 
SPCC to budget for repairs. 

Currently, SPCC uses the naive method (demand from the previous 


quarter factored by repair survival rate and wearout rate) to determine the 








quantity of a particular repairable to use in repair budget calculations and 


inventory level determination [Ref. 1:p. 3-A-31]. Unfortunately, this estimate is 
a gross simplification of the carcass return process, widely variable, and leads 
to inaccurate budget estimates. Although funds can be reallocated during the 
course of a fiscal year, a good estimate of the number of carcass returns would 
assist planning in both the budget and inventory arenas [Ref. 2]. 

SPCC recognizes that carcass returns are a function of more than just the 
previous demand observation, but has not investigated the relationship. 
Intuitively, the quantities of demands and carcass returns from several previous 
quarters directly influence the number of carcass returns which can be 
expected at the DOP in the coming quarter. The ICP weapon system file holds 
the past eight quarterly observations for these variables, and is easily accessible 
to an IM or budgeteer for purposes of forecasting the quantities of carcass 


returns for particular stock numbered part. 


B. OBJECTIVES 

There are two objectives of this thesis. The first is to explore and delineate 
relationships between selected data elements contained in the SPCC repairable 
data file. For purposes of analysis, a snapshot of this file was taken of 


approximately 80,000 repairable items and contained the following data fields: 


Demand - eight past quarters. 

Carcass returns- eight past quarters. 

Net Price - price to customer with a returned carcass. 
Standard Price - price to customer without a returned carcass. 


Repair Price - contractual cost of repair. 





Wearout Rate - (or 1 - survival rate) from repair process. 

Mark Code - classification by quarterly demand and cost. 
The second purpose is to compare different forecasting methods to predict 
carcass returns at the DOP. The naive method will be used as a basis for 


comparison of the effectiveness of the forecasting techniques. 


C. PREVIEW 

Chapter II will describe the repairable carcass return process and, Chapter 
III will briefly examine the general layout of the data file and establish the 
relationships upon which this thesis is based. Chapter IV will detail different 
forecasting methods, discuss, and compare their results. Chapter V will 
summarizes the previous chapters, present conclusions and make 


recommendations regarding further research. 





Il. SYSTEM DESCRIPTION 


When a new combat system is introduced into the Navy, the method of 
support must be clearly defined. Secondary items generally fall into two 
categories: a) repairables - items which can be reworked or overhauled and put 
back into service, and b) consumables - items which are discarded upon failure. 
Initial expense and ease of procurement/production are concerns that the 
system command ponders when the weapon/combat system maintenance plan 
is being developed [Ref. 1:pp. 3-58-59]. The maintenance plan not only 
declares that an assembly/ part is repairable, but also specifies the location 
where the repair is to be accomplished. The Weapon System Planning 
Document (WSPD) declares the number and location of all current and future 
systems in fleet use, and projects the funded annual employment of an average 


system. 


A. ICP PERSPECTIVE 

It is important for both the IM and the SPCC budgeting sections to be able 
to estimate the number of Not Ready For Issue (NRFI) carcasses that will be 
repaired in the coming quarter. Inadequate amounts of funds allocated for the 
repair of a part/system can cause delays in the initiation of repair and fulfillment 
of customer requirements. Additionally, the ICP must reallocate funds from 
other areas. If too much money is slotted for the repair of repairables, other 
ICP functional areas suffer. 

In addition to the system maintenance plan and WSPD, the IM uses 


technical data about system/component failure rates, production and 














procurement leadtimes, available (RFI and NRFI) assets, historic demand and 
carcass return data to take action to fill customer requisitions. Figure 1 
illustrates the dynamics of the repairable material flow process. The IM must 
satisfy all: a) recurring demand (customer demands generated by in use failures 
and stock replenishment; b) initial outfittings and allowance changes; and c) 
parts needed in the repair of a higher level assembly. Possible sources of 
supply to fill requisitions for repairables are: a) procurement contracts, b) 
supply system stock on-hand, and c) items being repaired at the DOP. Efficient 


management of the repair process is essential to the item manager since, in 


End User 


ACRs/Outfittings 
Rep of Repairables 


Demand 





Figure 1 - Requirements and Sources of Supplies 


many cases, he/she will only be able to maintain adequate system stock of a 


particular part by receiving repaired items from the DOP. 











When the IM reviews the requirements needed in the near future, a critical 
piece of information is the number of NRFI carcasses which will arrive at, and 
be repaired by the DOP. In the long run, this number is simply the quantity of 
demand, less items which are lost or damaged beyond repair. In the short run, 
however, carcasses arrive at the DOPs in a less deterministic pattern. 
Currently, SPCC uses the demand quantity for the previous quarter for an 
estimate of the number of carcasses expected to arrive at the DOP. In reality, 
the NRFI part may take several quarters to arrive at the repair location. This 
time lag is possibly created by remote locations of end users unexpedited 
shipments/transshipment, etc. Once an item arrives at and is registered on the 
records of the DOP, the repair process can be managed and, if necessary, 
expedited. The one missing piece of information from the IM’s perspective 
(other than a deterministic demand figure) is the number of NRFI carcasses 


that will be available for repair in a given quarter. 


B. MATERIAL FLOW 

A teview of the requisition-material flow process can highlight some of the 
reasons for the randomness of the carcass returns to the DOP. The process 
starts with a demand for a part by the end user. The end-user’s requirement is 
satisfied by RFI assets on hand at the time of the requirement, or by new/ 
repaired material received into the supply system after the requisition has been 
backordered. Replenishment of stock levels primarily comes from the repair of 
repairable items at the DOPs. 

The IM has facts or good estimates concerning the demand-carcass return 
process. For example the IM knows historical demand patterns, repair turn- 


around time (RTAT), program requirements, and inventory levels. He does not 








know, however, the number of carcasses he can expect to show up at the DOP 
for repair. The lack of this information affects the size of the new procurement 
contract since additional safety stock must be ordered to cover the uncertainty 
of the carcass returns and subsequent repairs.. 

The NRFI carcass should be shipped back to the DOP at the time the 
requisition is submitted into the supply system. There are numerous reasons 


for possible delays. For example: 


a) The ship may be out to sea and cannot return the carcass until it arrives 
in port. 


b) The carcass is left in a system, to allow the system to operate in a partial 
capability mode, and will be turned in upon receipt of an RFI asset. 


c) The carcass is turned over to a tanker during refueling in the Indian 
Ocean. The tanker will turn in the carcass upon return to overseas port. 


The item manager can review lists of every outstanding requisition, 
wholesale system RFI assets available for issue, and wholesale system NRFI 
assets in or available for induction into the repair process. Parts held onboard a 
ship, or in the transportation pipeline are “invisible” (not on the SPCC’s files) 
and difficult to track. The best that an IM can do, is to make an intuitive guess 
as to the number of NRFI carcasses that will arrive at the DOP in a given time 
period. 

As a preliminary step to modelling carcass returns, the next chapter 
discusses the structure of the raw data used for this thesis, establishes the data 
fields and variables, and examines some of the basic relationships between the 


variables. 








II. DATA OVERVIEW 


This chapter begins with a description of the SPCC repairable item data 
tape, and examines some of the relationships between data elements/ variables. 
Data highlights from the preliminary review and grooming of the data set will be 
discussed. The third section covers the initial examination of the correlation 


between the variables in the data set. 


A. DATA FILE DESCRIPTION 

The file used for purposes of this thesis was extracted during March 1991 
by the Fleet Material Support Office, Mechanicsburg, PA, and contained eight 
past quarter data for all 7G, and 7H cog items. The data elements contained on 
the tape were (listed with data element number (DEN), the number SPCC uses 


to standardize reference to the data fields in its major files): 


Field Name Card Column DEN Type 
NIIN 1-9 D046 Character 
4 Digit COG 10-13 Character 
Mark 14 Integer 
FGC 15-18 COO1A Character 
SMIC 19-20 Character 
LRC 21-23 BO002 Character 
Past Quarter 24 Integer 
Procurement Leadtime 25-28 BO11G Real 
Production Leadtime 29-32 B010G Real 
Demand 33-41 A005 Integer 





Carcass Returns 42-50 A005B Integer 
Comm Repair Qty 51-54 BO12L Integer 
Comm Rep Days 55-58 BO12K Integer 
Navy Repair Qty 59-62 B012G Integer 
Navy Repair Days 63-66 B012H Integer 
Rep TAT 67-71 BO12E Real 

Unit Price 72-81 B053 Real 
Replacement Price 82-91 BOSS Real 
Repair Price 92-101 BOS5A Real 

Net Price 102-111 B059 Real 
Blank 112-115 

MCC 116 C003A Character 
FSC 117-120 C042 Character 
Wearout Rate 121-124 F007 Real 

Rep Survival Rate 125-128 F009 Real 


This datafile contained 79,384 usable NIINs, with up to 8 past quarter 
observations per record. The zero demand items (no demand in the past two 
years) were discarded and considered of no concern since the vast majority of 
these items also had zero carcass returns over the same period. There are 
27,731 items that experienced at least one demand over the two year period. 
This data set will hereafter be referred to as DEMAND. The past quarter 
demand and carcass return observations for a given NIIN will be labelled DM1 


through DM8 and CR1 through CR8 respectively. In this case the higher the 


number assigned, the older the variable. For example DM§8 is the variable 








containing demand information from the eighth past quarter, while CR2 contains 
carcass return data from the second past quarter. 

The demand and carcass return processes for the majority of the NIINs on 
file are in a relatively steady state. This means that a snapshot of the 
observations can be expected to reveal that the two year sum of carcass 
returns is approximately equal to the sum of the demands. This is generally the 
case, however, 980 items (of the DEMAND file) met the condition that 
carcass returns minus demands was greater than or equal to twenty-five and 
were removed. This possibly indicates that the item is being phased out, or 
there is a modification program in effect for upgrade of the component. In the 
creation of the DEMAND file a few data errors and unreasonable demand or 
carcass return observations were noted. Specifically, items with non-numeric, 
or extraordinarily high numeric (ie; 999999999) demand and/or carcass return 
observations, were extracted and not used in any of the analysis. Twenty two 
of these items were removed leaving the DEMAND file with 26,729 items. 


For the remainder of this thesis the following conventions will be followed: 


1) Each model will attempt to forecast CR1, and the forecasted value will be 


denoted by CRi. DM1 and CR1 cannot be used for prediction purposes 
since CR1 must be reserved for the evaluation of the effectiveness of the 
forecasts. In actual practice, however, both variables could be used 
leading to increased accuracy of the models. 


2) DM2 - DM8, and CR2 - CR8 will be considered independent variables, 


although it will be necessary to forecast CR2 and use the model 
parameters to obtain an estimate for CR1. 


B. DATA HIGHLIGHTS 
Some preliminary graphical data analysis on a sample of 323 items 


indicated that all of the count variables, DMi and CRi , have similarly shaped 


10 











histograms. Figure 2 shows two representative examples, constructed from the 


items contained in the DEMAND file. 


Carcass Return Frequency Demand Frequency Distribution 
300 CR1, Combined Data 300 DMS - Combined Data 





0 6 122 8B Hw WD BH 4 48 0 6 2 BA DW BW 4 & 
range Tange 


Figure 2 - Representative Variable Histograms 

The spike on the left side of each graph is due to the large number of zero 
observations for CR1 and DMS. This is representative of all of the past quarter 
DMi and CRi variables. As a starting point to investigate the differences in the 
types of items included in the file, items were categorized by the quantity of 
demand observed. This categorization was made to investigate characteristics 
of items with different demand patterns. Although the MARK Code could be 
used to segregate demand categories it also incorporates the extended money 
value for quarterly demand and thus, would not be appropriate to use for this 
analysis. The categories defined for the remainder of this thesis are: 


Combined - All items included in the categories below. 


High - Demand 2 10 for the two year period. 
Med - 4< Demand < 10 for the two year period. 
Low - Demand < 4 for the two year period. 


The new data sets contained 5784 high (21.6%), 4770 medium (17.8%), 


and 16175 (60.6%) low demand items. A random sample of 323 items was 
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drawn and will be used for analysis. The sample’s characteristics are shown in 
Table 1 below. 
TABLE 1 - RANDOM SAMPLE CHARACTERISTICS 


Except for the range and dispersion of the variables, the general nature of 











the histograms did not change using the segregated data sets. Representative 
graphs are shown in Appendix A, Figures Al -A3. 

All of the demand and carcass return variables, regardless of the demand 
category, contain a significant number of zeros. This complicates portions of 


the analysis and will be discussed later. 


C. CORRELATION 

A preliminary estimate of the correlation between the variables was taken 
using Pearson product moment correlation. The correlation matrix is presented 
in Table - 2 and can be seen on the next page, where each entry is the Pearson 
correlation between the row and column variable for that entry. 

No significant patterns can be seen between carcass return variables with 
a one period lag, demand variables with a one period lag and carcass return- 
demand variables with a one period lag. All of the correlation figures are 
relatively high and do not really clarify the relationship between CRI and the 


other variables. From Table 2, earlier observations such as DM8 and CR8 


could as easily be related to CR1 as DM2 and CR2. This does not intuitively 








make sense, since the later carcass returns and demand observations should 


should exert greater influence on the dependent variable CR1. 


TABLE 2 - PEARSON PRODUCT-MOMENT CORRELATION, 


COMBINED DATA 

DM1 DM2 DM3 DM4 DMS DM6 DM7 DM8& CRI CR2 CR3 CR4 CRS CR6 CR7 
DM1 1.000 
DM2 0.825 1.000 
DM3 0.684 0.797 1.000 
DM4 0.555 0.661 0.753 1.000 
DMS 0.687 0.796 0.879 0.714 1.000 
DM6 0.675 0.797 0.864 0.748 0.854 1.000 
DM7 0.716 0.801 0.814 0.673 0.796 0.831 1.000 
DM8 0.581 0.640 0.721 0.598 0.722 0.752 0.661 1.000 
CR1 0.849 0.809 0.617 0.526 0.580 0.617 0.683 0.629 1.000 
CR2 0.661 0.637 0.688 0.556 0.576 0.630 0.654 0.537 0.654 1.000 
CR3 0.299 0.338 0.383 0.347 0.357 0.351 0.322 0.324 0.312 0.367 1.000 
CR4 0.670 0.712 0.801 0.740 0.758 0.800 0.712 0.654 0.604 0.692 0.434 1.000 
CRS 0.773 0.785 0.817 0.685 0.804 0.835 0.759 0.737 0.717 0.711 0.408 0.864 1.000 
CR6 0.600 0.738 0.751 0.633 0.705 0.850 0.809 0.739 0.674 0.641 0.344 0.730 0.791 1.000 
CR7 0.622 0.684 0.712 0.655 0.675 9.743 0.715 0.763 0.690 0.617 0.404 0.711 0.750 0.754 1.000 
CR8 0.654 0.739 0.775 0.613 0.780 0.808 0.759 0.638 0.523 0.508 0.273 0.682 0.661 0.678 0.570 


For a given NIIN and time, the demand-carcass return process has a finite 
number of RFI and NRFI carcasses in the supply system. Since this is a 
closed-loop demand-carcass return process, a high degree of correlation and 
collinearity exists between the most of variables. The correlation matrix 
illustrates this point, but is only an indication of pairwise correlation [Ref 3:pp 
328-9]. Although it is logical to assume that the first past quarter demand and 
carcass returns have more importance than the earlier observations, the 
correlation matrix indicates that there is a high correlation between the CRI, 
CR2 and the earlier quarters. This is due to the nature of the data, and can be 
misleading. It seems logical, however, that there would be a one to two quarter 
lag between demand and carcass return observations. This will be further 


examined in the next chapter. 
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D. MODEL COMPARISON 

This thesis requires a yardstick of performance of future forecasts. For 
estimating the relative effectiveness of these forecasts, mean absolute deviation 
(MAD) and Bias will be used. These measures are commonly used to 
evaluate forecast models and, if necessary, adjust the model parameters. MAD 
is an attractive indicator because SPCC currently uses MAD when forecasting 
demand, which ultimately impacts the calculation of safety stock. Bias will not 
be used to select the best model, but rather to evaluate model tendencies. The 


formulas in terms of this problem are: 


n n 
¥, |cR1;-CRI| ¥ (CR1-CR1}) 
MAD =! , and Bias = =! 


n n 


where n is the number of items in the sample, CR1; and CRij are the actual 
observation of carcass returns for NIINj and the forecast value of carcass 
returns for NIINj respectively, for the first past quarter. 

The MAD will illustrate the size of the forecast error and the Bias will 
indicate its direction. The MAD, as defined above, is the average error of the 


individual forecasts of the model. (Ref. 4:p. 45] 
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IV. FORECASTING TECHNIQUES 


This chapter attempts to use regression analysis and exponential smoothing 
to model the carcass return process. A discussion of pros and cons of each 


method are discussed. 


A. REGRESSION 

The first attempt to model the carcass return process was done using the 
four sample files described in Chapter II. Regression was an attractive 
method because it could take into account all of the past demand and carcass 
return observations, while minimizing the squared error term. Another 
advantage is that regression is efficient and requires few computer resources. 
Initial regression attempts used all demand and carcass return variables other 
than the first past quarter to predict CR1. Obviously, DM1 could not be used for 
prediction purposes, since DM1 and CR1 were observed simultaneously. The 
intent was to use all of the information available to be able to forecast CR1 with 
the least amount of error. 

It became clear that in order to evaluate the forecasts, the model would 
have to be fit with the second past quarter as the dependant variable, and then 
use the regression coefficients obtained to predict the first past quarter. This 
approach restricts the number of variables available for parameter selection to 
CR3 - CR8 and DM2 -DM8. In actual practice CR1 would be the dependant 
variable and all other variables would be used to pick the regression 


coefficients. Thus the demand and carcass return data from the third to eighth 
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past quarters were used to obtain coefficients to predict CR2;, as shown in the 


following equation: 


ER2; = By+B,CR3;+B,DM3;+. . . 


+B, 1,CR8;+B}2.DM8; 
where Bo, Bi,.-., Bi2 are the regression coefficients. 


Using the coefficients from the regression, CR2j becomes: 


ERi; = Bo+B1CR2+BsDM2;+ . . . 
+B,,CR7 +B DMZ; 


This forecast simply moves the regression coefficients one period forward 
and uses CR2; instead of ER3;, since CR2j; would be have been observed and 
available for the prediction of CRij . In actual practice it would not be 
necessary to do this since CR1 would be known, and could be used in place of 
CR2, to predict CRO. This should increase the model’s predictive capability. 

As a preliminary step to the regression of the combined and _ individual 
demand category samples, a regression was run using SAS on all 26,729 items 
in the DEMAND file. Table 3 is the result of this regression and indicates that 
33.46% of the total amount of variation was explained by the independent 
variables. Checking the residuals, with the Kolmogorov D statistic provided in 
SAS Procedure UNIVARIATE, yielded a p-value (Ho:€i~NID(0,02))< .01 [Ref. 
S:p. 1187], which means that the normality assumption concerning the residuals 


does not hold and that regression may not be appropriate. Examination of the 
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residuals for the 26,729 stock numbers in the DEMAND file is impractical, but 


can easily be accomplished for the smaller samples. 
TABLE 3 - DEMAND FILE REGRESSION RESULTS 


Dependent variable is: CR2 
R2 = 33.46% _R2 (adjusted) = 33.43% 


s= 2.37 with 26729-13= 26716 degrees of freedom 


Source Sum of Square df | Mean Square _ F-ratio 


Regression 76619.43 12 6384.95 1136.7 
Residual 152378.36 26716 5.52 











The next step was to regress the combined and individual category samples 
and examine the output. Table 4 shows that more of the variation found in the 
combined sample can be explained by the regression equation. The regression 


TABLE 4 - COMBINED SAMPLE REGRESSION RESULTS 
Dependent variable is: | CR2 


R2 = 60.1% R2(adjusted) = 58.6% 


$= 3.117 with 323 - 13 =310 degrees of freedom 





















Source Sum of Square df Mean Square _‘F-ratio 
Regression 4545.23 12 379.0 39.0 





Residual 3012.22 310 9.7168 


MAD Regression 1.461 
MAD Naive 9412 


tables and normal probability plot of the residuals for the segregated category 














data set can be found in Appendix B. 
The normal probability plot shown in Figure 3 confirms that the residuals are 
indeed non-normal. To investigate further, Figure 4 plots the externally 


studentized residuals versus the predicted values obtained in the regression. 
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This is a better method to examine the residuals, since they are standardized 


by the estimated standard deviation of the errors with the ith point deleted. 
Figure 4 shows that as the value of the prediction increases, the dispersion of 
Normal Probability Plot 
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Figure 3 - Probability Plot, Combined Data, Residuals 
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Figure 4 - Studentized Residuals used to Examine Variance of €; 
the errors also increases indicating non-homogeneous variance of the errors 
[Ref 6:pp. 437-9]. There are two predictions with unusually large error terms 
indicative of outliers. Upon closer examination the two points with the largest 
value of studentized residuals have unusual patterns of carcass returns and 


demand. Both observations have one past quarter observation where carcass 
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returns is unexpectedly high indicative of a modification or a RFI unit turn-in 
request from the ICP. 

R2 for the Low and Medium demand regressions were 27.6% and 19.8% 
respectively. In both cases the errors were obviously not normal. R2 for the 
High demand category was better at 58.7%, and the errors appear to be 
normally distributed which satisfies the regression assumptions. It is not 
unexpected that the amount of deviation and R2 is highest for the High demand 
data since the High demand category has the most variation in the CR1 
observations, and defines a much wider range of possible observations than the 
other two categories. Logically, a deviation of one unit is more significant for 
the Low and Medium demand categories than for the High category. The 
MAD in each case is better for the naive model than the regression model . 

At this point it does not seem reasonable to use regression for forecasting 
carcass returns for any category NIINs. The results for the combined sample 
category items can be slightly improved by adding category variables indicating 
demand category to the independent variables used in the regression. The 
results here are disappointing: R2 became just 60.2%, and the model MAD was 
lowered to 1.452. 

To gain insight into the relationship between the variables, stepwise 
regression was run for all of the above data sets. As seen in the results of the 
correlation evaluation in Chapter III, it was not totally surprising to find that 
there was no predictable pattern conceming the variables which were retained 
in the model. To forecast CR2 for the Combined sample the model retained 
DM3,DM5 , DM7, CR4, and CRS. The variables retained in the stepwise 


regressions on the segregated data sets were: High- CR3, and CR5, Med- DM7, 
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and DM8, and Low- DM6, and CR3. Therefore, based on these results, it is 
difficult to access the exact nature of the relationship between demands and 
carcass returns. 

The difficulty with using regression to predict carcass returns is that it not 
only requires a good size sample, but the regression equation is fit to a wide 
variety of items. This and the fact that the normality assumption is so clearly 
violated is most likely why regression does not provide an adequate forecast of 
carcass returns. Attempts to get a better fit by using the covariates listed in 
Chapter III, Section A. (eg. wearout rate, Federal Supply Classification, 
MARK code, etc.) were not successful. Until a better method is available to 
help group like items, regression will probably not be a viable method for 
modelling carcass returns. For these reasons, it was decided not to further 
refine the regression model (eg. introducing higher order terms or fitting a 


generalized linear model). 


B. EXPONENTIAL SMOOTHING 

Exponential smoothing is a common forecasting technique, which SPCC 
currently uses to forecast demand patterns. This technique makes no 
assumptions about linearity or normality and, in the right situation, can be very 
powerful. Trend and demand can be incorporated into the model, with simple 
modifications. It was decided to use exponential smoothing rather than an 
autoregressive or other time series model because of the nonlinearity and the 
fact that each series is so short. 

Exponential smoothing forecast are typically updated recursively using the 
current observation of the dependent variable. This allows the model to react 


quickly when there is a change in the level of the forecast variable. (Ref. 7:pp. 
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60-1] Typically, a forecaster selects parameters suited for his/her purpose and 


only changes them when the observations warrant. The approach here is 
somewhat different, in that the MAD is used to select the best set of 
parameters in order to forecast the coming quarter. If the model was run in 
two subsequent quarters it is quite possible that a different parameter set would 
be selected for the same NIIN. Remember, that the model for this thesis will 
forecast CR2, and use the MAD minimizing parameter set to forecast CR1. In 
actuality, the model would use all eight past quarter demand and carcass return 
observations to obtain the parameters to forecast CRO. 


The model in use here is: 


GRi=8(aCRi+1+(1-c)(CRi+1+Trendi+1)) 
+(1-5)Dlit1, i=2,3,...,8 
where 


Trendi+1=B(CRi+1-CRi+2)+(1-B)Trendi+2. 


and 
Dli=yDMi+1-(1-y)DIi+2_ 

Since the time units are past quarters, the higher the subscript, the older the 
observation or component of the equation. The parameters a, B, 5, and y, must 
be between zero and one, and determine how much weight or emphasis is put 
on the different variables in the above equations. For example, as the value of 
a increases, CRi+1, the latest value for CR in time period i+1, becomes more 
important to the forecast. Accordingly, the last forecast and the Trend factor 


are de-emphasized. 
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Unlike regression, this model forecasts. carcass returns on a NIIN by NIIN 
basis. This could prove to be very useful for the item manager’s carcass return 
quantity predictions for total costs extensions on repair contracts. Additionally, 
with the advent of higher speed mainframe computers, SPCC could run the 
algorithm for items individually, or as a group. The major advantage to 
exponential smoothing forecast models is that the entire history of an item is 
contained in the current forecast [Ref. 4:p. 53]. 

This technique allows for the past history of CR and DM to be used in the 
prediction of of future values of CR. This exponential smoothing model assumes 
that part of difference between the forecast and actual observation is due to 
demand, trend, and level with the remaining difference attributable to random 
chance. Optimum values for a, B, 5 and y can be found by employing the 
parameter search algorithm written for this purpose and presented in Appendix 
C. Since there are only eight past quarter DM and CR values, the first period 
CR and DM observations are used as the first period CR8 and DI8 respectively. 
The model sequentially generates forecasts for all of the previous periods and 
picks the parameter set with the lowest MAD averaged over periods 7 through 
2. The model then forecasts CRi, and calculates the MAD for the particular 
NIIN. This figure will be used for comparison to the other methods. 

To adhere to the integer nature of the data, recognizing that fractional 
values are unrealistic, two rounding schemes are examined. The first technique 
is to round up to the next integer if the fractional part of the forecast is greater 
than or equal to 0.5. This is simply standard rounding practice and can be 
expected to influence the MAD calculation for a given sample. The second 


technique is to round up if there is any fractional remainder. This method has 
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been employed in various inventory situations, with the understanding that it 
will yield a conservative forecast for the amount of stock necessary to meet 
demand. If MAD is the criteria, however, this practice will increase the 
residuals of the forecasts. 

In actual use there are two methods for picking the initial forecasts for 
exponential smoothing models. If sufficient data is available, it would be 
appropriate to segregate the data into two sets, use the first set to select 
parameters, and then use the second set to gauge the model’s effectiveness 
[Ref. 4:p. 56]. For example, if a business had two years of weekly demand data 
available before its initial attempt to utilize exponential smoothing to forecast, 
the first year’s data could be used to adjust the parameters. The forecaster 
could evaluate several different levels of the smoothing coefficients until he was 
satisfied he had a set that adequately described his demand pattern. He would 
then use the selected parameters and see how accurate the model predicted 
the second year’s data. If necessary he would go back and pick a new set of 
parameters. 

This particular case, however, has only 8 data points and the above method 
would be impractical. Thomopoulos suggests that the first observation for the 
forecast variable serve as the initial forecast when few data points are available 
[Ref. 6:p. 61]. This was exactly what was done, using CR8 for ER8, DM8 for 
DI8, and zero for Trend8. 

Initially the model was run exhaustively for all possible parameter 
combinations between zero and one. o and 6 were incremented in steps of .05, 


and B and y by .01. Each NIIN starts with the parameter seta =Bp=5= y 


= 0.0. When the CR? is determined, a MAD is calculated using the actual 











observation for CR2. This value is the compared to the variable MADLOW, 
which keeps track of the lowest MAD recorded for that NIIN. If the new 


MAD is lower than the current MADLOW value, the parameter set and 
forecast are recorded. The loops of the program increment a, B, 5 and y in 
turn, and calculate a MAD for each possible combination of parameters. 

Hillier and Lieberman suggest guidelines to keep & and 6 within the range 
of .1 to .3 to prevent over-smoothing of the data [Ref. 8:p 747]. The model was 
initially run using the full range (zero to one) for all four parameters, and then 
run restricting « and 5 within the suggested limits. Further references to these 
models will be as either the full range or the restricted range models. As 
expected, the full range model picked the ER2 MAD minimizing parameter set 
for each individual NIIN, however, the average MAD for the entire sample 
(and demand category samples) was higher than the average MAD in the 


restricted range model. This will be discussed in the next section 


C. COMPARISON OF RESULTS 

Detailed output listings for both the full and restricted range exponential 
smoothing models are contained in Appendix D. Table 5 is a description of the 
column headings for the output. Included in the listings are the MAD 
minimizing parameter sets, the first period forecast, naive forecast, regression 
forecast, MAD for each forecast method and the MAD by which the model 
selected the parameters. The bottom of each table summarizes the significant 
data for each category sample. These summary statistics are averaged over 
the number of NIINs in each category. MAD is presented for each forecasting 


technique and rounding method, but Bias is included for the Naive method and 
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the standard rounding of the exponential smoothing and regression forecasts 
only. Accursory review of the summary statistics in each table illustrates that 


TABLE 5 - APPENDIX D OUTPUT DESCRIPTION 


LO 1 = Low category, 0= High or Med 
Forcst 1st pd Exponential smoothing forecast, CRij 
CR2hat Best MAD MAD value for parameter selection 
MAD Ist Pd CR1;- CR1i 


MAD Regress MAD of unrounded regression forecast 


MAD Rg StdRnd/RndUp MAD Regress Rnd Std and Rnd Up 


the “Round Up” method of integerizing the forecasts increases the MAD over 




















that of the Naive method and does not merit further review. 

A comparison of the “CR2hat Best MAD” in Tables D1 and D2 shows that 
although the full range model picked a parameter set with a lower MAD, the 
category MAD summaries at the end of the tables favor the restricted range 
model in every case. This is indicative of an over-fitting of the data which 


introduces additional error when predicting CR1. Further discussions and 
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comparisons involving exponential smoothing will refer to the restricted range 
model/output using the standard rounding method only. 

Table 6 presents extracted data from Table D2. Notably, the MAD for the 
exponential smoothing model is lower than that of the Naive or regression 
models for the combined demand, and each individual demand category. The 
surprising finding is that the Naive model performed reasonably well, across all 
demand categories. 

In terms of the direction of the forecasting errors, the Bias term in Table 6 
indicates that while the exponential smoothing model had a tendency to slightly 
under-forecast, the other two methods over forecasted. The fact that the 
exponential smoothing and naive model biases were nearly zero, means that the 
positive and negative errors nearly cancelled and, adds little information by 
which to judge effectiveness of forecasts. The regression model bias, however, 
was somewhat larger and negative and, shows a tendency to over-forecast not 
present in the other two models. The sample variance of the forecast errors 
indicates that the variation of the errors for the exponential smoothing and naive 
methods are somewhat lower than that of the regression model. 

The next issue is to determine if there is statistical significance to the 
differences of the MADs of the different models. It is a simple matter to 
construct a (1-a«)100% confidence interval of the absolute difference of the 
forecast errors. The null hypothesis is that there is no difference in the means 
of the forecast errors, with the alternative being that the null hypothesis is false. 
Constructing confidence intervals for @ = .05 and a = .01 indicates whether 
there is a Statistical difference in the methods at significance level a. If the 


confidence interval contains zero there is no significance at level @, and the null 


26 














hypothesis cannot be rejected. The standard presentation of the two-sided 
hypothesis test is shown below. Although this test is structured using the 


TABLE 6 - COMPARISON OF FORECASTING TECHNIQUES 
Category Method MAD Bias S2airt 


Regression 1.461 - 0.656 10.793 


Regression 3.430 - 1.312 18.248 


Regression 1.968 - 1.246 26.824 


Regression 0.505 - 0.212 0.416 





exponential smoothing and naive methods, all combinations of forecasting 
methods will be tested. 
Formally: 

Ho: HES - UN =0 

Ha: HES - UN #0 
where HES and LN represent the expected MADs the the respective methods. 
The confidence interval is constructed using the formula 

Hes-lin + Zay\\ Shit 
where [es-Hn is the difference in the MADs of the exponential smoothing and 
naive methods respectively, and Sditt is the sample variance of the difference of 


the forecast errors. 
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The resulting (1-a)100% confidence interval (-0.094,0.259) indicates that 


the null hypothesis cannot be rejected and that the difference in means is not 
significant at a=.05 [Ref. 9:p. 372]. Confidence intervals, p values, and 
hypothesis test conclusions for each demand category are presented in Table 7. 

In each case, the null hypothesis is the of the same form, with the 
substitution of the means being tested. The conclusions are based on a 95% 
confidence interval on the difference in means. The p values in Table 7 indicate 
that with the exception of the Medium category the null hypothesis can be 
rejected for the tests concerning the differences between regression and the 
other methods and that there is no statistical difference between exponential 
smoothing and the naive method. Except for the size of the confidence 
intervals, the same results hold for a = .01 and are not presented here. 

Figure 5 plots the forecast errors of the naive and the exponential 
smoothing models and shows that the two methods generally err in the same 
direction. With only a few exceptions the errors lie around the line y=x. 

Using the the confidence interval formulation above, setting the lower limit 
equal to zero, and solving for n (using the standard deviation of the set of 
differences) indicates the sample size necessary for the there to be a statistical 
significant difference between the exponential smoothing and naive methods of 
forecasting. For the combined sample this calculation indicates that a sample 
size of 1,491 items are necessary for the models to be statistically different. 


The formulation for n is: 
n= Gears 2 
HeEs-| ; 
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A second random sample of 2,100 NIINs was drawn to verify if this is 
indeed the case. The new sample size was selected to be conservative and 


account for any possible differences in variance between the samples. Since 


TABLE 7 - METHOD COMPARISON HYPOTHESIS TESTING 
DIFFERENCE IN METHODS A =.05 


Combined ES-Nav__ (-.094,.259) 362 cannot reject Ho 
High ES-Nav = (-.453,.824) 570 cannot reject Ho 
Medium ES-Nav__ (-.189,.476) 377 cannot reject Ho 
Low ES-Nav __ (-.044,.088) 521 cannot reject Ho 


there was Statistical difference between regression and the other model, 



















regression will not be further tested. 

Due to the size of the sample, the program in Appendix C was modified to 
reduce the output. The modified program keeps track of the cumulative 
differences and the cumulative squared differences between the two methods. 
This allows for the computation of the mean and variance of the differences. 
The program output and conclusions are summarized in Table 8 below, and the 
modified program can be found in Appendix E. 


Table - 8 confirms that the techniques are indeed different with the larger 


sample. Once again, the results for @ = .01 lead to identical conclusions and 











are not presented here. The conclusions in Table 8 only indicate that there is a 


difference in the two methods. The results of the sample of 323 were certainly 


Exp Smoothing vs Naive 
15 Forecasting Errors 





Naive Method 





-15 -10 -5 Oo. § 10 15 
Exp Smoothing 


Figure 5 - Forecasting Error Comparison 
disappointing, and dilute the dramatic impact of the difference in the methods 


revealed by the larger sample. 


TABLE 8 - EXPONENTIAL SMOOTHING - NAIVE METHOD 
HYPOTHESIS TESTING DIFFERENCE IN METHODS 
a =.05 


Category WES-UN S2aitt CI p Conclusion 
Combined 0.131 1.048  (0.174,0.086) .000 reject Ho 


High 0.346 2.448 — (0.487,0.204) .000 reject Ho 


Medium 0.156 0.496 (0.228,0.084) .000 reject Ho 


Low 0.041 0.147 (0.062,0.020) .000 reject Ho 
















Although it was necessary to increase the sample size to distinguish 
between exponential smoothing and the naive models, the results are 


encouraging. This small difference between the models has practical as well as 
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statistical significance. The next chapter presents a summary of the thesis and 


presents conclusions and recommendations for further research. 








V. SUMMARY, CONCLUSIONS, AND 
RECOMMENDATIONS 


A. SUMMARY 

This thesis has examined the relationship between demand and carcass 
return of repairables at SPCC, and attempted to differentiate between three 
forecasting methods. Each NIIN had only eight observations for demand and 
carcass returns, and it was difficult to establish a relationship based on time, 
since all of the past quarter variables appeared very similar and contained 
numerous zero observations. Regression, exponential smoothing, and the naive 
method of forecasting were used to model carcass returns. A comparison of 
the results indicated that although it was difficult to distinguish between 
exponential smoothing and the naive method, they each performed better than 
regression. A larger sample was drawn to reassess the two models with 


exponential smoothing out-performing the naive method. 


B. CONCLUSIONS 

SPCC’s procurement budget is huge and driven, in part, by the 
unpredictability of the demand - carcass return process. SPCC has statistical 
controls on demand, repair survival rate, wearout rate, RTAT, but uses only the 
demand observation from the past quarter to predict the number of carcasses 
that will arrive at the DOP for repair. The IM bases his buy decision on the 
anticipated program growth or decline, and the total number of carcasses, RFI 
and NRFI, in the system. The accuracy or inaccuracy of any forecast of 


carcass returns at the DOP obviously affects the number of repairables required 
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to support a given system at the system command prescribed readiness rates. 


Generally, the lack of accuracy of any inventory forecast translates into 
sucreased safety stock, if the readiness goals are to be achieved. 

The extended dollar value of SPCC’s annual buys for repairable repair parts 
is huge, being valued at over $300 million. It is difficult to access what 
percentage of these buys are directly attributable to the inaccuracy of the naive 
forecast driving increased safety stock, but it is apparent that even a small 
improvement can save a significant amount of money. The difference in the 
means of the forecast errors of the exponential smoothing and naive method 
(for the sample of 2100) was 0.131, which extended across the number of 
SPCC managed repairables experiencing at least one demand in two years 
(27,731 for purposes of this thesis), could mean an important savings of 


procurement dollars in an era of fiscal awareness. 


C. RECOMMENDATIONS 

The results of this thesis imply that it is possible to improve the forecasting 
of carcass returns at the DOPs, potentially saving money in the process. 
Conservatively, the use of the exponential smoothing model proposed involves 
no more risk than the naive model for forecasting carcass returns. This model 
should be further tested in the real life environment at SPCC. It could be easily 
incorporated to the IM Workbench, which is a statistical program by which IMs 
can Statistically analyze a NIIN’s demand pattern in order to calculate 
perspective buys. 

Attention should be place on evaluation of the difference in performance of 
the methods with full eight period data compared to the seven period data used 


for this thesis. The addition of the extra observation would add to the accuracy 
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of exponential smoothing and do nothing for the naive model. This would 


differentiate the models even further. 
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APPENDIX B 


Regression Output and Plots of Residuals for the Categorized Data 
Sets 


TABLE Bl - HIGH CATEGORY DATA SET REGRESSION 
OUTPUT 


Dependent variable is: CR2 
R2 = 58.7% R2(adjusted) = 51.0% 


s= 6.214 with 77-13 =64 degrees of freedom 


Source Sum of Squares = df Mean Square F-ratio 
Regression 3517.96 12 293.0 7.59 


Residual 2471.21 64 38.6126 


MAD Regression 3.911 
MAD Naive 2.506 
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Figure B1- Probability Plot, High Data, Residuals 











TABLE B2 - MED CATEGORY DATA SET REGRESSION 
OUTPUT 


Dependent variable is: CR. 
R2= 19.8% R2(adjusted) = -2.1% 


s= 1.254 with 57-13 =44 degrees of freedom 


2 
Source Sum of Squares df Mean Square F-ratio 
Regression 17.0433 12 1.420 0.903 


Residual 69.2023 44 1.57278 


MAD Regression _ 1.653 
MAD Naive .9824 


Normal Probability Plot 
Residuals, Med Data 
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Figure B2 - Probability Plot, Med Data, Residuals 


38 





TABLE B3- LOW CATEGORY DATA SET REGRESSION 
OUTPUT 


Dependent variable is: CR2 
R2=27.6% R2(adjusted) = 22.7% 


s= 0.9172 with 189-13 = 176 degrees of freedom 














Source Sum of Squares _—_ df. Mean Square F-ratio 
Regression 56.5824 12 4.715 5.60 
Residual 148.063 176 0.841268 


MAD Regression __.4385 
MAD Naive .2910 


Normal Probability Plot 
Residuals, Low Data 
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Figure B3 - Probability Plot, Low Data, Residuals 
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APPENDIX C 


Exponential Smoothing Parameter Search Algorithm 


/fEXPLO02 JOB (9003,9999),'EXPLO02 FORTRAN',CLASS=J 
/[*MAIN SYSTEM=SY2 

// EXEC VSF2CG 

/[FORT.SYSIN DD * 


Ke 2 he fe he fe 2h he ie ae ie 2 2 fe 2h fe 2h 2h 2 a 2h 2h 29s fe fe fe 2 Fe fe fe 2Fe 2c ac 2 af afc 2c ake ake fe fe fe ake fe ake 2c afc fe fe fe a akc a ate 2h 2 2h 2c ak 2 a he 2 of 2k ok 2c 2k 2 


* Student: Robert B. Vassian, 15 July 1991, Thesis Program 
* Advisors: Prof. Lyn R. Whitaker, CDR David Wadsworth 
* 


* The purpose of this program is to exhaustively search for the MAD minimizing 

* parameter set for the carcass returns of the second past quarter, and then using 

* those parameters, forecast carcass returns for the first past quarter. The 

* program generates an output file for evaluation of the effectiveness of exponential 
* smoothing and the Naive forecasting policies. 

He He ee Fe 2 2 he he 2c ae he 2h 2c He he 2c fe 2h 2 e292 2 2 2c 2k He 2 he 2h ae 2h ae ae Re 2 a 2h ee 2h ae 2 2 a oe 2 ie 2 2 2h oe ke ie 2 ok ie ie kc 2 ak ic oe ake ok ke 


* ¥ y 8 RY YH *¥ 


CHARACTER NIIN(8)*9 
INTEGER CR(8), DM(8), PQ(8), MADACT 

INTEGER I,K,L,M,N,PQ,HI,LO 
REAL A,B,D,G,ALPHA,BETA,DELTA, GAMMA 
REAL FOR(9), TREND(8),DI(8),MAD,MADLOW, MAD6 


DO 35 K=1,63 
DO 20 I=8,1,-1 
READ(5,51) NIIN(),PQ(D,DM(D,CR(),HILLO 
20 CONTINUE 
D=.00 
DELTA =.00 
MADACT=ABS(CR(1)-DM(2)) 
MADLOW=1000.0 
DO 23 Q=1,21 
A=.00 
DO 30 L=1,21 
B=.00 
DO 25 M=1,101 
G=.00 
DO 22 N=1,101 
MAD6=0.0 
TREND(8)=0.0 
FOR(8)=CR(8) 
DI(8)=DM(8) 
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DO 10 P=7,1,-1 
FOR(P)= D*¥(A*CR(P+1)+(1-A)*(FOR(P+1)+ 
& TREND(P+1)))+(1-D)*(DI(P+1)) 
TREND(P)=B*(FOR(P)-FOR(P+1))+(1-B)* TREND(P+1) 
Di(P)=G*DM(P)+(1-G)*DI(P+1) 
IF(P.GT.1) MAD6 =MAD6+(ABS(CR(P)-FOR(P)))/6 
10 CONTINUE 
IF(MAD6 .LT. MADLOW) THEN 
MADLOW=MAD6 
MAD=ABS((CR(1)-FOR(1))) 
FORB=FOR(1) 
ALPHA=A 


WRITE(6,48) NIIN(1),HI,LO,CR(1), CON DELTA,ALPHA,BETA, 
& GAMMA,MADLOW,MAD,MADACT 
35 CONTINUE 
48 FORMAT(1X,A9,1X,11,1X,11,1X,14,1X,14,1X,F8.4,1X,F5.3,1X,F5.3,1X, 
& ¥F5.3,1X,F5.3,1X,F8.4,1X,F8.4,1X,14) 


49 FORMAT(A9) 
51 FORMAT(A9,I1,19,19,11,11) 
STOP 
END 

f* 

//GO.SYSIN DD * 
0102166078 0 001 64 
0102166077 0 001 64 
0102166076 1 101 64 
0102166075 0 001 64 
0102166074 0 001 64 
0102166073 0 001 64 
0102166072 0 001 64 
0102166071 0 001 64 
0114898428 0 001 125 
0114898427 1 001 125 
0114898426 0 101 125 
0114898425 1 101 125 
0114898424 0 201 125 
0114898423 0 001 125 
0114898422 1 001 125 
0114898421 0 001 125 
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0115320008 
0115320007 
0115320006 
0115320005 
0115320004 
0115320003 
0115320002 
0115320001 
i 

/ 


ooorcoo°o 


001 126 
001 126 
001 126 
001 126 
001 126 
001 126 
001 126 
001 126 
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APPENDIX E 


Modified Exponential Smoothing Parameter Search Algorithm 
for Large Sample Size 
//EXPPRG1 JOB (9003,9999),"EXPPRG1 FORTRAN’ ,CLASS=J 


//*MAIN SYSTEM=SY2 
/| EXEC VSF2CG 


//FORT.SYSIN DD * 
36 2 3c fc he kc ke ik 2c 2 fee oie a oie 2 2k 2 oC ie oe ie oe fe oe 2h oe 2k kc 2c ok 2c 2c ae 2fe fe 2c 2c 2c ke 2c ke oie 2 oie 24 2c ok 2k akc 2h a ie ake 2 oe oie oc oc 2 2c 2k 2 
* Student: Robert B. Vassian, 12 August 1991, Thesis Program * 
* Advisors: Prof. Lyn R. Whitaker, CDR David Wadsworth = 
* * 
* This program is the exhaustive search rouitine for the for the larger * 
* sample. The program finds the MAD minimizing parameter set for each * 
* NIN but only returns the cumulative differences and cumulative ca 
* squared differences. Calculations are also made for the naive method * 
* and are returned upon compltetion of the run. * 


2 he He ee he ee 2h 2h oe afe 2h 2c Fe fe 2c fe fe 2h 2c ie 2h 2k Fe 2 a fe 2 2 2 2c ac fe 2c fe 2 2c 2c oe ae 2 2 2k 2 2k fe 2c 2c 2 fe 2c ae a 2h ake ake fe 2k fe ok 2k 2 


CHARACTER NIIN(8)*9,NIN*9 

INTEGER CR(8), DM(8), PQ(8),,HI(8)MADACT, XI,XISQ,HXI,LXI,MXI 
INTEGER HXISQ,MXISQ,LXISQ,DIFF,DIFFH,DIFFM,DIFFL,MADES 
INTEGER I,K,L,M,N,P,Q,FORR,HIGH,LLOW,MED 

REAL FOR(9),A,B,D,G,ALPHA,BETA,DELTA,GAMMA,DI(8), TREND(8) 
REAL MAD,MADLOW,MAD6,MADN 

REAL NXI,NXISQ,HNXI,HNXISQ,LNXI,LNXISQ 

REAL MNXI,MNXISQ,MCH,MCHN,MCM,MCMN,MCL,MCLN 


Cc DO 15 T=1,14400 
¢ READ(2,45) NIN 
C 15 CONTINUE 
DO 35 K=1,300 
MADES= 0 
FORR= 
DO 20 I=8,1,-1 


READ(2, 51) NIIN(),PQ(D,DM(1),CR(),HICD 
20 CONTINUE 

D=.10 

DELTA =.10 

IF(HI(1) .EQ. 10) THEN 

HIGH = HIGH+1 
ELSEIF(HI(1) .EQ. 01I)THEN 
LOW=LOW+1 
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10 


Ze 
25 
30 
23 


ELSE 
MED = MED+1 
ENDIF 
MADACT=ABS(CR(1)-DM(2)) 
MADLOW=1000.0 
DO 23 Q=1,5 
A=.10 
DO 30 L=1,5 
B=.00 
DO 25 M=i1,101 
G=.00 
DO 22 N=1,101 
MAD6=0.0 
TREND(8)=0.0 
FOR(8)=CR(8) 
DI(8)=DM(8) 
DO 10 P=7,]1,-1 
FOR(P)= D*(A*CR(P+1)+(1-A)*(FOR(P+1)+ 
TREND (P+1)))+(1-D)*(DI(P+1)) 
TREND(P)=B*(FOR(P)-FOR(P+1))+(1-B)*TREND(P+1) 
DI(P)=G*DM(P)+(1-G)*DI(P+1) 
IF(P.GT.1) MAD6 =MAD6+(ABS(CR(P)-FOR(P)))/6.0 
CONTINUE 
IF(MAD6 .LT. MADLOW) THEN 
MADLOW=MAD6 
IF((FOR(1)-(REAL(NT(FOR(1))))).GE. 0.5) THEN 
FORR = INT(FOR(1)) +1 
MADES= ABS(CR(1)-FORR) 


FORR = INT(FOR(1)) 
MADES= ABS(CR(1)-FORR) 





ELSE 


ENDIF 
ENDIF 
G=G+.01 
CONTINUE 
B=B+.01 
CONTINUE 
A=A+.05 
CONTINUE 
D=D+.05 
CONTINUE 
MAD=MAD+REAL(MADES) 
MADN=MADN+REAL(MADACT) 
DIFF=ABS(ABS(MADES)-ABS(MADACT)) 
XI=X1I+DIFF 
XISQ=XISQ+DIFF**2 
IF(HI(1).EQ.10) THEN 
MCH = MCH+REAL(MADES) 
MCHN= MCHN+REAL(MADACT) 
DIFFH = DIFF 
HXI=HXI+DIFF 
HXISQ=HXISQ+DIFF**2 
ELSEIF(HI(1). 01) THEN 
CL = MCL+REAL(MADES) 


62 








MCLN = MCLN+REAL(MADACT) 
DIFFL =DIFF 
LXI=LXI+DIFF 
LXISQ=LXISQ+DIFF**2 
E 


MCM=MCM+REAL(MADES) 
MCMN=MCMN+REAL(MADACT) 


DIFFM=DIFF 
MXI=MXI+DIFF 
MXISQ=MXISQ+DIFF**2 
ENDIF 
35 CONTINUE 


WRITE(6,49) XI,XISQ,K, DIFF 

WRITE(6,50) MAD/(K-1), MADN/(K-1) 

WRITE(6,46) HXI,HXISQ,HIGH, DIFFH 
WRITE(6,50) MCH/REAL(HIGH),MCHN/REAL(HIGH) 
WRITE(6,47) MXI.MXISQ,MED, DIFFM 

WRITE(6,50) MCM/REAL({MED),MCMN/REAL(MED) 
WRITE(6,48) LXI,LXISQ,LOW, DIFFL 

WRITE(6,50) MCL/REAL(LOW),MCLN/REAL(LOW) 


49 FORMAT(1X,' XI = ',I9,' XISQ = ‘114, K = ‘I5,'D ', 15 ) 
46 FORMAT(1X,' HXI = ',19,' HXISQ = ',114,' HIGH = ‘I5,'D ', 15 ) 
47 FORMAT(1X,' MXI = ',I9,' MXISQ = ',114,' MED = ',I5,' D ', I5 ) 
48 FORMAT(1X, LXI = ',19,, LXISQ = ‘114,’ LOW = 'I5,' D', 15 ) 
45 FORMAT(A9) 
50 FORMAT(1X,,MADCUM = ',F15.4,, MADCUMN = ',F15.4) 
51 FORMAT(A9,I1,19,19,12) 
STOP 
END 
/* 


/{(GO.FT02F001 DD DISP=SHR,DSN=MSS.S9003.EXPSAM 
H 
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